Working papers in speech recognition
نویسندگان
چکیده
This report represents a collection of papers published in various conference proceedings that are not readily available for researchers working in the field of speech recognition. The papers reprinted are: 1. Reddy — Speech Input Terminals (June 1970). 2. Reddy, Erman, and Neely — The CMU Speech Recognition Project (October 1970). 3. Erman and Reddy — Telephone Speech (August 1971). 4. Neely and Reddy — Noise in Speech (August 1971). 5. Reddy — Speech Recognition: Prospects (August 1971). 6. Reddy, Bell, and Uulf — Speech Recognition in a Mul tiprocessor Environment (December 1971). 7. Reddy, Erman, and Neely — A Mechanistic Model of Speech (April 1972). The authors would like to thank Allen Newell who has read most of the papers appearing in this report and made several valuable comments on them. Ue would like to express our appreciation to Bunny Kostkas and Heather Shoub for their typing, editing, and modification of the manuscript using the PDP-10 XCRIBL Document generation system.
منابع مشابه
Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملSession 12: Speech Recognition II
In the first paper, Jim Glass sketched some of the avenues pursued in the MIT SUMMIT work. They have been working on input normalization, different segmental representations, and issues in experimental phonetics. The auditory model requires careful normalization (a condition also explored by IBM in several ICASSP papers), and a non-linear adaptive technique was described without results. Bounda...
متن کاملStatistical Variation Analysis of Formant and Pitch Frequencies in Anger and Happiness Emotional Sentences in Farsi Language
Setup of an emotion recognition or emotional speech recognition system is directly related to how emotion changes the speech features. In this research, the influence of emotion on the anger and happiness was evaluated and the results were compared with the neutral speech. So the pitch frequency and the first three formant frequencies were used. The experimental results showed that there are lo...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کامل